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Description 

[0001] The present invention relates generally to the fields of molecular biology and nucleic acid chemistry. More 
specifically, it relates to methods for the amplification of long nucleic acid sequences by the polymerase chain reaction. 

5 [0002] The polymerase chain reaction (PCR), a powerful tool for the amplification of nucleic acid sequences, is 
disclosed in U.S. Patent Nos.4,683,202; 4,683,195; 4,800,159; and 4,965,188. In its simplest form, PCR is an in vitro 
method for the enzymatic synthesis of specific DNA sequences, using two oligonucleotide primers that hybridize to 
complementary strands and flank the region of interest in the target DNA. A repetitive series of reaction steps involving 
template denaturation, primer annealing, and the extension of the annealed primers by a DNA polymerase results in 

10 the geometric accumulation of a specific fragment whose termini-are defined by the 5' ends of the primers. PCR Is 
capable of producing a selective enrichment of a specific DNA sequence by a factor of 10 9 . The PCR method is also 
described in Saiki et al., 1 985, Science 230: 1350-1354. 

[0003] PCR has been widely applied in molecular biology, molecular evolution, medical genetics, population genetics, 
forensic biology, and genome mapping and sequencing projects. However, current PCR are limited in the size of the 

1& region of DNA that can be amplified reliably. C »' 

[0004] Attempts to overcome the length limitations of PCR are reported in Glukhov et al., 1991, Molek. Biol. 25 > 
1602-1610; Kainz etal., 1992, Anal. Biochem.202: 46-49; Ohlerand Rose, 1992, PCR Meth. Applic. 2:51-59; Ponce 
and Micol, 1992, Nucl. Acids Res. 20:623; and Rychlik et al., 1990, Nucl. Acids Res. 18: 6409-6412. Although ampli- 
fications of 5-15 kb sequences were achieved, the reported yields of the longer products were low. J 

20 [0005] Efforts to optimize PCR conditions led to the conclusion that the use of Tth DNA polymerase may be favorable 
when amplifying larger targets. The amplification of PCR products larger than 12.2 kb, however, was not enabled when ; 
using Tth polymerase (Ohier etal., 1993, PCR Meth. Applic. vol. 3, No. 2, p. 115-119): 

[0006] PCR methods capable of amplifying long nucleic acid sequences would facilitate genomic mapping and se- 
quencing as well as molecular cloning through the amplification of long, low-copy insert material, and by making pos- 

25 sible the assembly of larger recombinant constructions in PCR-based mutagenesis. There remains a need for methods ^ 
that will enable PCR amplification of targets of at least 25 kb with high yields. f ^ 

[0007] The present invention provides improved methods and reagents for the PCR amplification of long DNA targets.' 
[0008] One aspect of the invention relates to combinations of thermostable DNA polymerases which are useful in 
the methods of the present invention. The combinations consist primarily of Thermus thermophilus DNA polymerase, 

30 a highly active thermostable DNA polymerase that does not exhibit 3'-to-5 ( exonudease activity, and secondarily of 
either Thermococcus titoralis, Pyrococcus species GB-D, or Thermotoga mahtima DNA polymerase, all thermostable 
DNA polymerases that exhibit 3'-to-5' exonudease activity. 

[0009] Another aspect of the invention relates to a buffer useful for carrying out the amplification of long targets. 
[0010] Another aspect of the present invention relates to PCR amplifications using the specific combinations of ther- < 
35 mostable enzymes described above. The reaction conditions are specified so as to enable the amplification of nucleic 
acid target sequences of up to 42 kilobases in length. 

[0011] Another aspect of the invention relates to kits comprising reagents useful in carrying out the methods of the 
present invention. Such kits comprise a combination of thermostable DNA polymerases as described above and, op- 
tionally, additional amplification reagents which are useful in the methods of the present invention. 

40 [0012] To aid in understanding the invention, several terms are defined below. <. £ 

[0013] The term "amplification reaction mixture" , as used herein, refers to an aqueous solution comprising the various 
amplification reagents used to amplify a target nucleic acid. The reagents include primers, enzymes, aqueous buffers, 
salts, target nucleic acid, and deoxynucleoside triphosphates (both conventional and unconventional). Depending on 
the context, the mixture can be either a complete or incomplete reaction mixture. 

45 [0014] The terms "nucleic acid" and "oligonucleotide", as used herein, refer to primers, probes, and oligomer frag- 
ments to be detected, and shall be generic to potydeoxyribonucleotides (containing 2-deoxy-D-ribose), to polyribonu- 
cleotides (containing D-ribose), and to any other type of polynucleotide which is an N -glycoside of a purine or pyrimidine 
base, or modified purine or pyrimidine bases. There is no intended distinction in length between the term "nucleic acid" 
and "oligonucleotide", and these terms will be used interchangeably. These terms refer only to the primary structure 

so of the molecule. Thus, these terms include double- and single-stranded DNA, as well as double- and single-stranded 
RNA. 

[0015] Because mononucleotides are reacted to make oligonucleotides in a manner such that the 5' phosphate of 
one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor in one direction via a phosphodiester 
linkage, an end of an oligonucleotide is referred to as the "5' end" if its 5' phosphate is not linked to the 3* oxygen of a 
55 mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5' phosphate of a subsequent mono- 
nucleotide pentose ring. 

[0016] The exact size of an oligonucleotide depends on many factors and the ultimate function or use of the oligo- 
nucleotide. Oligonucleotides can be prepared by any suitable method, Including, for example, cloning and r striction 
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of appropriate sequences and direct chemical synthesis by a method such as the phosphotriester method of Narang 
et al., 1979, Meth. Enzymol. 68:90-99; the phosphodi ster method of Brown et al., 1979, Meth. Enzymol. 68:109-151; 
the diethylphosphoramidite method of Beaucag t al., 1981 , Tetrahedron Lett. 22:1859-1862; and th solid support 
method of U.S. Patent No. 4,458,066. A review of synthesis methods is provided in Goodchild, 1990, Biocon jugate 

5 Chemistry 1(3):1 65-1 87. ) 

[001 7] The term "hybridization", as used herein, refers to the formation of a duplex structure by two single stranded 
nucleic acids due to complementary base pairing. Hybridization can occur between complementary nucleic acid strands 
or between nucleic acid strands that contain minor regions of mismatch. Stability of a nucleic acid duplex is measured 
by the melting temperature, or "T m ." The T m is the temperature (under defined ionic strength and pH) at which 50% of 

10 the base pairs have dissociated. Those skilled in the art of nucleic acid technology can determine duplex stability 
empirically considering a number of variables including, for example, the length of the oligonucleotide; base composition 
and sequence of the oligonucleotide, ionic strength, and incidence of mismatched base pairs. * - 
[0018] Conditions under which only fully complementary nucleic acid strands will hybridize are referred to as "strin- 
gent hybridization conditions". Stringent hybridization conditions are well known in the art (see, e.g., Sam brook et al., 

*5 1985, Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. Gen- 
erally, stringent conditions are selected to be about 5°C lower than the T m for the specific sequence at a defined ionic 
strength and pH. Typically, stringent conditions will be those in which the salt concentration is at least about 0.2 molar 
at pH 7 and the temperature is at least about 60°C. Relaxing the stringency of the hybridizing conditions will allow 
sequence mismatches to be tolerated; the degree of mismatch tolerated can be controlled by suitable adjustment of 

20 the hybridization conditions. - ! : : 

[001 9] Two single-stranded nucleic acids that are complementary except for minor regions of mismatch are referred 
to as "substantially complementary". Stable duplexes of substantially complementary sequences^can be achieved 
under less stringent hybridization conditions. Those skilled in the art of nucleic acid technology can determine duplex 
stability empirically considering a number of variables including, for example, the length and base pair concentration 

25 of the oligonucleotides, ionic strength, and incidence of mismatched base pairs. 

[0020] The term "primer", as used herein, refers to an oligonucleotide, whether natural or synthetic, capable of acting 
as a point of initiation of DNA synthesis under conditions in which synthesis of a primer extension product complemen- 
tary to a nucleic acid strand is induced, i.e., in the presence of four different nucleoside triphosphates and an agent 
for polymerization (i.e., DNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable tempera- 

30 ture. A primer is preferably a single-stranded oligodeoxyribonucleotide. The appropriate length of a primer depends 
on the intended use of the primer but typically ranges from 15 to 35 nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently stable hybrid complexes with the template. '■ 
[0021] A primer need not reflect the exact sequence of the template but must be sufficiently complementary to hy- 
bridize with a template. Primers can incorporate additional features which allow for the detection or immobilization of 

35 the primer but do not alter the basic property of the primer, that of acting as a point of initiation of DNA synthesis. For 
example, non-complementary sequences can be located at the ends of the primer to provide restriction enzyme cleav- 
age sites useful in the cloning of an amplified sequence. 

[0022] The terms "upstream" and "downstream", as used herein, referto the location of the primer binding sites along 
the target sequence. The upstream primer hybridizes to the non-coding strand of the target sequence, and therefore 

40 forms the 5* end of the amplified sequence which is a subsequence of the coding strand of the target sequence. 
Similarly, the downstream primer hybridizes to the coding strand of the target sequence, and therefore forms the 5' 
end of the amplified sequence which is a subsequence of the non-coding strand of the target sequence. 
[0023] The terms "target sequence" and "target nucleic acid sequence", as used herein, refer to a region of the 
oligonucleotide which is to be amplified, detected, or both. The target sequence resides between the two primer se- 

45 quences used for amplification. 

[0024] The term "thermostable nucleic acid polymerase", as used herein, refers to an enzyme which is relatively 
stable to heat when compared, for example, to nucleotide polymerases from E. coff, and which catalyzes the polym- 
erization of nucleoside triphosphates. Generally, the enzyme will initiate synthesis at the 3-end of the primer annealed 
to the target sequence, and will proceed in the S'-direction along the template until synthesis terminates. 

so [0025] The methods of the present invention use specific combinations of a DNA polymerase from Thermus ther- 
mophtlus (Tth) with a DNA polymerase from either Thermotoga maritima (Tma), Pyrococcus species GB-D, or Ther- 
mococcus Utoralis (Tli). . 
[0026] The terms B 3*-to-5' nuclease activity** and "proofreading activity", as used herein, refer to that activity of a 
template-specific nucleic acid polymerase whereby nucleotides are removed from the 3' end of an oligonucleotide in 

55 a sequential manner. < 

[0027] A unit (U) of polymerase activity is a measure of the amount of enzyme needed to synthesize nucleic acid at 
a given rat . The activity units specified herein are as defined by th respective suppliers of each polymerase, as listed 
below. Because activities may be assayed u nder diff rent specific conditions, activity of one nzyme may not be directly 
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comparable to activity of another enzyme. 

[0028] Recombinant DNA polymerases from Thermus thermophiius (rTth) and Thermatoga maritima (UlTma) are 
commercially available from Perkin Elmer, Norwalk, CT. One unit of rTth or UlTma™ DNA polymerase is defined by 
the commercial supplier, Perkin Elmer, as the amount of enzyme that will incorporate 10 nmoles of dNTP into acid 
s Insoluble material at 74° C in 30 minutes, as measured in a 1 0 minute incubation in a 50 \i\ reaction consisting of the 
following: 

200 u.M each dATP, dGTP, dTTP 
100 \iM [ot-^PJ-dCTP (0.05 to 0.1 Ci/mmole) 
io activated salmon sperm DNA 

100mMKCI 
2.2 mM MgCl2 

25 mM TAPS [tris-(hydroxymethyl)-methyl-amino-propanesulfonic acid, sodium salt], pH 9.3 at 25°C 

1 mM beta-mercaptoethanol 

15 

[0029] Recombinant DNA polymerases from Thermococcus litoralis (Vent^® ) and Pyrococcus species GB-D (Deep 
VentR® ) are commercially available from New England Biolabs, Beverly, MA. One unit of Ventp,® or Deep Ventp® 
DNA polymerase is defined by the commercial supplier, New England Biolabs, as the amount of enzyme that will 
incorporate 1 0 nmoles of dNTP into acid insoluble material at 75°C in 30 minutes in a reaction consisting of following: 

20 

200 u,M each dNTP (dATP, dCTP, dGTP, and 3 H-dTTP) 
0.2 mg/ml activated DNA 
10mMKCI 
10mM(NH 4 ) 2 SO 4 
25 20 mM Tris-HCI, pH 8.8 at 25°C 

2 mM MgS0 4 
0. 1% Triton X-1 00 

[0030] Conventional techniques of molecular biology, microbiology and recombinant DNA techniques, which are 
30 within the skill of the art, are explained fully in the literature. See, e.g., Sambrook, Fritsch and Maniatis, Molecular 
Cloning; A Laboratory Manual, Second Edition (1989); Oligonucleotide Synthesis (M.J. Gait, ed., 1984); Nucleic Acid 
Hybridization (B.D. Hames & S.J. Miggins, eds., 1984); A Practical Guide to Molecular Cloning (B. Perbal, 1984); and 
a series, Methods In Enzymology (Academic Press, Inc.). 

[0031] The present invention provides improved methods and reagents for the PCR amplification of long DNA targets. 

35 The PCR amplification process for the amplification of short nucleic acid sequences is well known in the art and de- 
scribed in U.S. Patent Nos. 4,683,1 95; 4,683,202; and 4,965, 188. Commercial vendors, such as Perkin Elmer, Norwalk, 
CT, market PCR reagents and publish PCR protocols. For ease of understanding the advantages provided by the 
present invention, a summary of PCR is provided. - - 

[0032] In each cycle of a PCR amplification, a double-stranded target sequence is denatured, primers are annealed 

40 to each strand of the denatured target, and the primers are extended by the action of a DNA polymerase. The process 
is repeated typically between 25 and 40 times. The two primers anneal to opposite ends of the target nucleic acid 
sequence and in orientations such that the extension product of each primer is a complementary copy of the target 
sequence and, when separated from its complement, can hybridize to the other primer. Each cycle, if it were 100% 
efficient, would result in a doubling of the number of target sequences present. 

45 [0033] In order to achieve efficient PCR amplification of long targets, several requirements must be met. First, target 
sequences must be completely denatured. Longer targets are Increasingly likely to contain GC-rich stretches that are 
prone to incomplete denatu ration because of their relatively high melting temperatures. Incomplete strand separation 
permits rapid renaturation of the target DNA, possibly precluding the annealing and extension of PCR primers. Second, 
extension times must be sufficiently long to allow the completion of strand synthesis in each PCR cycle. Third, long 

so targets must be protected against degradation during amplification. Long targets are more susceptible to degradation 
and strand breakage under PCR conditions. Initial template integrity and subsequent strand survival during PCR are 
therefore important considerations. The methods of the present invention are designed to meet these requirements 
for long PCR without compromising either polymerase activity or the specificity necessary for single-copy gene ampli- 
fications from genomic DNA. 

55 [0034] Improving target strand separation, lengthening the extension times, an protecting the template DNA from 
degradation during thermal cycling greatly increase the maximum amp I if i able target length, but are insufficient to 
achieve efficient amplification of targets. Th fidelity of nucleic acid synthesis is a limiting factor In achieving amplifi- 
cation of long target molecules in the 23-42 kb range. 
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[0035] The mis in corporation of nucleotid s during th synthesis of primer xtension products limits th length of 
target that can be efficiently amplified. The ffect on prim r xtension of a 3'-terminal base that is mismatched with the 
template is described in Huang et al., 1 992, Nucl. Acids Res. 20: 4567-4573. The presence of misincorporated nucle- 
otides may result in prematurely terminated strand synthesis, reducing the number of template strands for future rounds 
s of amplification, and thus reducing the efficiency of long target amplification. Even low levels of nucleotide misincor- 
poration may become critical for sequences longer than 1 0 kb. 

[0036] The fidelity of DN A synthesis is improved if a small amount of thermostable 3'-to-5' exonuclease, or "proof- 
reading", activity is present in the reaction in addition to the DNA polymerase activity. The proofreading activity appar- 
ently improves the yields of long products by removing misincorporated nucleotides and permitting complete strand 
10 synthesis by the predominant polymerase activity. An important aspect of the present invention refers to specific mix- 
tures of thermostable DNA polymerases that greatly Increase the maximum target length amplifiable by providing both 
3'-to-5' exonuclease activity and polymerase activity. 

[0037] Proofreading exonuclease activity is not found in Tth DNA polymerase (Myers and G elf and, 1 991 , Biochem- 
istry 30: 7661-7666), but is inherent in the DNA polymerases from Thermococcus litomfis (the NEB Transcript (1 991 ) 
is vol. 3, no. 1, XP000391405), Pyrococcus species GB-D, and Thermatoga maritima. However, amplification 'of long 
targets with Vent R <B> DNA polymerases alone is less efficient than with Tth DNA polymerase which does not exhibit 3'- 
to-5' exonuclease activity. The decreased amplification efficiency is probably due, at least in part, to primer degradation 
and a decrease in net processivity resulting form the competition between the S'-to-S* exonuclease and polymerase 
activities. 

20 [0038] The relative amounts of 3'-to-5' exonuclease activity and polymerase activity can be controlled by mixing DNA 
polymerases. By combining a small amount of a secondary polymerase which has proofreading activity, such as 77/ 
DNA polymerase, with an active primary polymerase, such as Tth DNA polymerase, the advantage of a proofreading 
activity can be combined with the active DNA polymerase activity inherent in the primary polymerase. 
[0039] Nearly all aspects of PCR protocols affect the amplification efficiency of long target molecules. Extension 

25 times, co-solvents, and polymerases (with and without 3'-to-5'-exo nuclease activity) are the most critical parameters, 
but the pH and composition of the reaction buffer, salts (K + and Mg 2 *), and primer design are also important variables 
for the success of amplifications of long targets. The effects of the individual components of a PCR amplification oh 
the amplification efficiency of long targets are discussed below. 

30 Temperature Cycling j. 

[0040] The amplification reactions exemplified herein use a two-step temperature cycle in which the reaction tem- 
perature alternates between a high temperature at which the target nucleic acid is denatured, and a lower temperature 
at which the primers anneal to the denatured target sequences and primer extension occurs. The time and temperature 

35 of each step in each cycle effects the efficiency of amplification. 

[0041] More complete target denaturation can be achieved by raising the denaturation temperature. However^ raising 
the denaturation temperature may cause higher rates of damage, such as depurination, which decreases the amplifi- . 
cation efficiency, as well as increases loss of polymerase activity. Although it is important to achieve complete dena- 
turation of the target nucleic acid, the rate of target damage must be simultaneously minimized. Consequently, moderate 

40 denaturation temperatures (e.g., about 94°C, depending on GC content) are preferred, with the completeness of de- 
naturation improved by the addition of co-solvents, as described below. " 
[0042] A relatively high annealing temperature (e.g., about 68°C) reduces the hybridization of primers to partially 
homologous target sites, thereby minimizing the synthesis of products from secondary priming sites. In amplifications 
using lambda DNA target as described in the Examples, a minimum of 5-6 minutes at 68°C is needed. The addition 

45 of a more stringent 70-75°C annealing step does not significantly improve yields. Similarly, more complex temperature 
profiles with temperature spikes to accommodate potentially problematic GC- or AT-rich stretches are not significantly 
beneficial. 

[0043] An extension time that permits the completion of strand synthesis is critical for achieving amplification of long 
targets. For the amplification of targets longer than 20 kb, an annealing and extension time of at least 12 minutes, but 

so no more than 22 minutes in any cycle, is preferred. Minimum extension times are dependent upon other factors, such 
as co-solvent levels, as discussed below. Amplification reactions in which the initial extension time used is about 12 
minutes and the extension time is increased 15-20 seconds per cycle yield less non-specific product formation than 
reactions in which an extension time of more than 15 minutes is used throughout the amplification. The autoextension 
feature of the thermal cycler marketed by Perkin Elmer, Norwalk, CT, provides a convenient way to increase the ex- 

55 tension times during an amplification reaction. 
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Reducing Amplification of Non-specific Targets 

[0044] Typically, PCR reagents are combined at room temperature before the initial denaturation step. Th low, less 
stringent temperature can result in the binding of primers either to other primers or to partially-homologous target 

5 sequences. Extension products can be formed from this non-specific primer binding which can lead to short products 
that serve as extremely efficient target competitors, thereby reducing the efficiency of amplification of the desired long 
product. A "hot-starf method minimizes the synthesis of primer extension product from non-specific primer hybridiza- 
tions by inhibiting extension reactions until the reaction temperature is increased enough to prevent such non-specific 
binding. Since genomic templates are likely to contain sequences of partial homology to the target primer sequences, 

10 a hot-start protocol is important to maximize efficiency of long target amplification. 

[0045] One method of achieving a hot-start involves withholding an essential PCR reaction component until the 
temperature of the amplification mixture has been raised to 75-80°C. Examples include withholding either the DNA 
polymerase or Mg 2+ , which is an essential catalyst for DNA polymerase activity. In one hot-start protocol, the essential 
component is added by hand after the denaturation temperature has been reached. Alternatively, the essential reaction 

is component can be withheld by separating reaction components within a reaction tube using a' heat-labile barrier, such 
as a wax that melts at the reaction temperatures. This minimizes the number of times the reaction tube must be opened, 
thereby decreasing the possibility of contamination. 

[0046] Another hot-start protocol which may be useful in the methods of the present invention utilizes uracil-N-gly- 
cosylase to degrade any non-specific product formed before the amplification mixture temperature is raised (see PCT 
20 Patent Publication No. WO 92/01814). : . 

PCR Reagents 

[0047] In a PCR, the primer extension reaction occurs when the primer-template mixture is incubated with a DNA 
25 polymerase under suitable polymerization conditions. These conditions are provided by a reaction mixture containing 
a divalent cation, a monovalent cation, all four deoxyribonucleotide triphosphates (dNTPs), and a buffering agent Co- 
solvents may be added to the reaction mixture which affect the denaturation conditions. Each of these components 
affects the efficiency of the extension reaction and is discussed separately below. * ' ■ - * 

30 DNA Polymerase 1 

[0048] The choice of the combination of thermostable DNA polymerases and their concentrations becomes partic- 
ularly important as the target length or sequence complexity Is increased. The combination of Tth DNA polymerase 
and 77/ DNA polymerase provides the most efficient amplification of long PCR products, and allows amplification of 

35 targets over 40 kb in length. "■_ 

[0049] The optimal amount of DNA polymerase in a PCR amplification depends on a number of factors, including 
the number of copies of target sequences present in the sample. For high-copy reactions 10 7 copies of target), 
higher yields are obtained by using 2-2.5 units (U) Tth DNA polymerase per 50 u.l reaction. Further increases in polymer- 
ase concentration result in an increase in the amplification of non-specific target molecules, resulting in higher back- 

40 ground levels when the amplified products are detected by agarose gel electrophoresis. For low-copy reactions 1 0 4 
copies of target), however, specificity is maximized using about 0.8-1 U Tth DNA polymerase per 50 u.l reaction. For 
intermediate copy numbers of target, maximum yields are achieved using intermediate polymerase concentrations. 
The optimal polymerase concentration is also dependent on the divalent cation concentration. At higher Mg 2+ concen- 
trations, polymerase levels were reduced to minimize accumulation of non-specific products. 

45 [0050] Using PCR with Tth DNA polymerase alone, the maximum target size amplifiablefrom high-copy phage lamb- 
da DNA samples was found to be limited to about 23 kb. Similarly, the maximum target size amplifiable from low-copy 
phage lambda DNA samples was found to be limited to about 1 0-12 kb. Dramatic increases in the size of the amplifiable 
target are achieved by adding a small amount of thermostable 3'-to-5'-exonuclease. 

[0051] As described above, 3'-to-5' exonuclease activity is not found in Tth DNA polymerase. Proofreading activity 
50 is added by combining the Tth DNA polymerase with a small amount of thermostable DNA polymerase that has a 
proofreading activity, such as the DNA polymerases from Thermococcus litoratis, Pyrococcus species GB-D, and Ther- 
motoga maritima. Low concentrations of any of these DNA polymerases are effective in extending the range of target 
sizes amplifiable by PCR using either Tth DNA polymerase; however, a combination of Tth and 77/ DNA polymerases 
has been found to be the most reliable and efficient. 
55 [0052] The optimal concentration ratio is approximately 0.015-0.15 U 77/ DNA polymerase per 2-2.5 U Tth DNA 
polymerase for amplifications from high-copy samples (> 10 7 copies of target in a 50 uJ reaction). For amplifications 
from low-copy samples (£ 10 4 copies of targ t in a 50 pJ reaction), the optimal concentration ratio is approximately 
0.015-0.15 U 77/ DNA polymerase per 0.8-1 U Tth DNA polymerase. Higher concentrations of 77/ DNA polymerase 



6 



EP 0 669 401 B1 



reduce yield, possibly due to primer degradation. 
Co-solvents 

5 [0053] A co-solvent, such as glycerol, is a critical reaction component for the efficient amplification of long targets. 
A number of co-solvents have been reported to facilitate PCR, including glycerol, dimethylsu If oxide (DM SO), polyeth- 
ylene glycol, and formamide. One way in which a co-solvent may influence the efficiency of long-target amplifications 
is by increasing the thermal stability of the DNA polymerase. Increasing the thermal stability slows the loss of DNA 
polymerase activity during the repeated high-temperature denaturation steps. 

10 [0054] Another effect is that a co-solvent may effectively lower the melting and strand separation temperatures, thus 
facilitating the denaturation of the template and increasing the specificity of primer annealing. For example the melting 
temperature can be lowered by 2.5-3°C by the addition of 10% glycerol. Thus, by the addition of a co-solvent, an 
increase in the completeness of target denaturation can be achieved without raising the denaturation temperature, 
which would simultaneously Increase the degradation of target molecules, as discussed above. 

is [0055] A standard Tth PCR buffer typically contains 5% (v/v) glycerol. An increase in the amount of glycerol added 
to an amplification reaction can significantly improve the amplification of long target sequences. Significant increases 
In the yield of a 9.4 kb target result from supplementing a standard Tth PCR buffer with 5% (w/v) glycerol. The per- 
centages described here do not include any glycerol contribution from the various enzyme stocks used. 
[0056] DM SO, preferably in a concentration of about 5-6% (v/v), may also be used alone. However, combinations 

20 of glycerol and DMSO are more effective for longer targets. Preferred concentration combinations include 5-14% (w/ 
v) glycerol with 0.5-5% (v/v) DMSO. For example, amplifications of phage lambda targets 25-34 kb long were enhanced 
by the combination of 1 -3% (v/v) DMSO with 1 0% glycerol, or by using 5% of both co-solvents; amplifications of phage 
lambda targets 35-42 kb long were most enhanced by the combination of 8-9% glycerol with 5% DMSO. Furthermore, 
with a combination of 3% DMSO and 10% glycerol, targets of up to 34 kb were readily amplified with a 10-minute 

25 extension time; with a combination of 1 % DMSO and 1 0% glycerol, amplification was limited to 26 kb targets. A preferred 
combination consists of 1 0% glycerol and 2.25% DMSO. 

[0057] DMSO, unlike glycerol, reduces the thermal stability of the polymerase. However, the effective lowering of 
melting and strand separation temperatures by 5.5-6°C per 10% DMSO may be the dominant effect in long PCR. The 
addition of DMSO may also increase the DNA stability by decreasing the rates of depurination and/or chain scission 
30 and may accelerate strand renaturation. The reduction of melting and strand separation temperatures by combinations 
of glycerol and DMSO is generally consistent with a total reduction estimated by adding the effects of each component 
alone. The enhancement of yields resulting from the effective lowering of the melting and strand separation tempera- 
tures by the addition of a co-solvent, as discussed above, is not readily duplicated by raising the denaturation or 
annealing temperature during PCR. ■ ' ' " 

35 

Buffers 

[0058] The pH of an amplification mixture affects the stability of the template DNA. Increasing the pH of the reaction 
can decrease the degradation of template DNA during thermal cycling. Although PCR amplification mixtures are pH 
40 buffered, the pH of a typical PCR reaction varies considerably during the temperature cycling because of the temper- 
ature dependence of the reaction buffer. The buffering agent used in a typical PCR is Tris, which has a ApKa of -0.031 
per °C. The fluctuation in pH during the temperature cycling can be decreased by using a buffering agent with a smaller 
ApKa. 

[0059] Two suitable buffers are Tris(hydroxymethyl)methylglycine (tricine), which has a ApKa of -0.021 per °C, and 
45 N,N-Bis(hydroxyethyl)glycine (bicine), which has a ApKa of -0.01 8 per °C; both values measured at 20°C and 0.1 M 
ionic strength (see Good and Izawa, 1 972, Meth. Enzymol. 24, Part B:53-68). With either a tricine or bicine buffer, the 
pH remains higher during the high temperature reaction conditions than with the typical Tris buffer, and the fluctuations 
in pH caused from the temperature cycling are decreased. 

[0060] Optimal buffers and pH are dependent on, among other things, the DNA polymerase used. Using Tth DNA 
so polymerase, a buffer consisting of 10-35 mM, preferably 20-25 mM, tricine at pH 8.5-8.7 (25°C) provides the most 
reliable results. Optimal buffer conditions may need to be determined empirically for the amplification of specific targets. 

Divalent Cation 

55 [0061 ] The preferred divalent cation for the amplification of DNA is Mg 2 *. In the absence of added 3'-to-5'-exonucle- 
ase activity, long PCR is enhanced at total Mg 2 * levels of 1 .7-2 mM. In the presence of proofreading activity, however, 
the highest yields ar obtained with 0.9-1 .3 mM total Mg 2 *. Increased yields of some targets can be achieved by 
increasing the Mg 2+ concentration up to 1 .5 mM while reducing th total enzyme concentration, particularly the primary 
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polymerase levels (to 1 .25-2 U Tth DNA polym rase). However, for some targets, reducing total enzyme levels in order 
to reduce the synthesis of non-specific products at high r Mg 2+ levels also r duces product yields. As with K+ levels 
described below, the Mg 2+ optimum for each system may need to be determined empirically. 

5 Monovalent Cation 

[0062] The preferred monovalent cation is K+, supplied as KOAc (K-acetate) or KCI. For the amplification of long 
target molecules, reduced K + levels are beneficial. A decrease in non-specific background can be achieved if the K + 
is supplied as KOAc rather than KCI. In general, K+ concentrations reduced by 10-40% are more favorable to long 
10 PCR than the standard levels (100 mM KCI for use with Tth DNA polymerase). Preferred concentrations for use with 
Tth DNA polymerase are 60-1 00 mM KOAc, preferably 80-85 mM KOAc. Optimal concentration ranges may be system- 
dependent. 

[0063] The efficiency of PCR amplifications using tricine or bicine buffers is similar using either KCI or KOAc as the 
monovalent cation. However, improved reaction robustness is realized using a tricine/KOAc buffer. A trie! ne/KOAc 
is buffer has a slightly lower ionic strength than a tricine/KCI buffer, which could help destabilize secondary structures in 
a template with a high G+C content, thereby improving the completeness of target denaturation. 
[0064] Although KCI and KOAc are the preferred monovalent salts, other monovalent salts may be useful in the 
methods of the present invention. These include NaCI, (NH 4 ) 2 S0 4 , K-glutamate, and NH 4 -acetate. 

20 Primers 

[0065] Primer concentrations may need to be optimized for each system and approximate starting template copy 
number. For example, for the phage lambda amplification reactions described in the Examples, below, a higher con- 
centration of primer was optimal for amplifying samples containing a high copy number of target than was optimal for 

25 amplifying samples containing a low copy number of target. For the high-copy reactions (£ 10 7 copies of target), the 
optimum primer concentration was 0.4-0.5 u,M of each primer. For low-copy amplifications (^ 10 4 copies of target), 
0. 1 5-0.2 u,M of each primer was most effective in the absence of proof reading activity, and 0.2 ^lM of each primer was 
best if 3'-to-5'-exonucleolytic activity was present. For intermediate copy-number reactions, increasing the primer con- 
centration above 0.2 jiM was as least as effective as increasing DNA polymerase levels, as discussed above, in en- 

30 hancing yields. The improved PCR protocols that enable the amplification of target nucleic acid sequences up to 42 
kb in length are summarized in Table 1 , below. . 3 = 

Table 1 

Optimal Long PCR Conditions 

35 Temperature profile 

25 to 40 amplification cycles (template copy number dependent) 
Two-temperature cycling: 

(a) Short denaturation step, (e.g. 94°C for 10-15 seconds) 

(b) Long annealing/extension step, (e.g. 68°C for 10-1 4 minutes initially, increased by 15-20 seconds per cycle 

40 

for at least 5-8 cycles) 

Final hold at 72°C for at least 1 0 minutes 

Hot-start 

45 Separate reagent (Mg 2+ , enzyme, or dNTPs) until all samples have reached 75-80°C, preferably using a wax barrier. 
Primary polymerase 

2,5 units Tth DNA polymerase per 50 u,l for high-copy template (£ 1 0 7 copies) 
0.8-1 .0 units Tth DNA polymerase per 50 uJ for low-copy template (£ 1 0 4 copies) 

50 

3'-to-5'-exonuclease (high- or low-copy template) 
0.01 5-0. 1 5 units 77/ DNA polymerase per 50 \l\ 

55 Co-solvent 

5-14% glycerol with 0.5-5% DMSO ' 
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Table 1 (continued) 
Optimal Long PCR Conditions 

Buffer 

20-25 mM tricine or bicine, pH 8.5-8.7 
Divalent cation 

0.9-1 .5 mM Mg 2+ total; 0.2 mM changes can be critical 

10 Monovalent cation 
80-85 mM KOAc 



15 



25 



30 



40 



50 



55 



Primer design 

Either 20-23 bp with 50-60% GC content, or longer sequences, to permit the use of relatively high annealing 
temperatures. 



Primer concentration 

0.4-0.5 u,M for high -copy template 1 0 7 copies) 
20 0.1 5-0.2 p,M for lowKiopy template (s 1 0 4 copies) 

dNTP concentration 

0.2 mM each dATP, dCTP, dGTP, dTTP 



[0066] In general, the nucteic acid in the sample will be DNA, most usually genomic DNA. However, the present 
invention can also be practiced with other nucleic acids, such as RNA or cloned DNA, and the nucleic acid may be 
either single-stranded or double-stranded in the sample and still be suitable for purposes of the present invention: 
Those skilled in the art recognize that whatever the nature of the nucleic acid, the nucleic acid can be amplified using 
appropriate modifications to the present methods. 

[0067] Due to the enormous amplification possible with the PCR process, small levels of DNA carry-over from sam- 
ples with high DNA levels, from positive control templates, or from previous amplifications can result in PCR product, 
even in the absence of purposefully added template DNA. If possible, all reaction mixes are set up in an area separate 
from PCR product analysis and sample preparation. The use of dedicated or disposable vessels, solutions, and pipettes 
(preferably positive displacement pipettes) for RNA/DNA preparation, reaction mixing, and sample analysts will mini- 
mize cross contamination. See also Higuchi and Kwok, 1989, Nature 339 :237-238 and Kwok, and Orrego, in Innis et 
al. eds., 1990 PCR Protocols: A Guide to Methods and Applications, Academic Press, Inc., San Diego, CA. 
[0068] Enzymatic methods to reduce the problem of contamination of a PCR by the amplified nucleic acid from 
previous reactions are described in PCT Patent Publication No. WO 92/01814 and U.S. Patent No. 5,035,996. The 
methods allow the enzymatic degradation of any amplified DNA from previous reactions. PCR amplifications are carried 
out in the presence of dUTP instead of dTTP. The resulting double-stranded amplification product which incorporates 
uracil is subject to degradation by uracil-N-glycosylase (UNG), whereas normal thymine-containing DNA is not degrad- 
ed by UNG. Amplification reaction mixtures are treated with UNG before amplification to degrade all uracil containing 
DNA that could serve as target Because the only source of uracil-containlng DNA is the amplified product of a previous 
reaction, this method effectively eliminates the problem of contamination from previous reactions (carry-over). UNG is 
rendered temporarily inactive by heat, so the denaturation steps in the amplification procedure also serve to inactivate 
the UNG. New amplification products, therefore, though incorporating uracil, are formed in an UNG-inactivated envi- 
ronment and are not degraded. 

[0069] Analysis of the amplified products may be achieved by a variety of means depending on the information 
desired. The nucleotide sequence of amplified products can be obtained using standard techniques, such as the pro- 
tocol described by Innis et al., 1988, Proc. Natl. Acad. ScL 85:9436-9440. The PCR amplification products can be 
sequenced directly (see Saiki et al., 1988, Science 239 :487-491 ) or indirectly by first cloning the products and replicating 
them in an appropriate host cell. 

[0070] Amplified nucleic acid sequences can be detected and purified by methods well known in the art (see Sam- 
brook, et al., 1 989, supra). Methods which separate molecules according to size, such as gel electrophoresis, can be 
used to purify the amplified nucleic acid. In particular, agarose and/or acrylamide gel electrophoresis are preferred 
means for analyzing amplified products (see Scharf et al., 1 986, Science 233 :1076-1 078). For greater size resolution, 
either field inversion gel electrophoresis or low-percent (0.3%) agarose gel electrophoresis may be used, as described 
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in the Examples. 

[0071] Amplified products can be detected by direct visualization of the electrophoretically siz fractionated product 
by, for example, staining with ethidium bromide. Alternatively, amplified products can be detected using oligonucleotide 
hybridization probes which are complementary to the target sequence. Under appropriate hybridization conditions, 

5 probes hybridize only to target nucleic acid sequences, The presence of hybrid duplexes, which can then be detected 
by various means, indicates the presence of amplified product To facilitate the detection of hybrid duplexes formed 
between probes and target nucleic acid sequences, either the primers or the probes may be bound to additional mol- 
ecules, such a detectable label or a molecule that enables the immobilization of the primer or probe. Labels incorporated 
into the probes to allow detection or immobilization should not affect the hybridization properties of the probes. 

10 [0072] Probes can be labeled by incorporating a label detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. Useful labels Include 32 P, fluorescent dyes, electron-dense reagents, enzymes 
(as commonly used in ELISAs), blotin, or haptens and proteins for which antisera or monoclonal antibodies are avail- 
able. Probes also can be bound to an additional compounds that are used to immobilize the probe on a solid support. 
[0073] Labeled probes can be synthesized and labeled using the techniques described above for synthesizing oli- 

15 gonucleotides. For example, the probe may be labeled at the S'-end with 32 P by incubating the probe with ^P-ATP 
and kinase. A suitable non-radioactive label for SSO probes is horseradish peroxidase (HRP). Methods for preparing 
and detecting probes containing this label are described in U.S. Patent Nos. 4,914,210, and 4,962,029. The use of 
such labeled probes is also described in U.S. Patent No. 4,789,630; Saiki et al., 1988, N. Eng. J. Med. 319 :537-541; 
and Bugawan et al., 1988, Bio/Technology 6:943-947. Useful chromogens for the detection of HRP labeled probes 

20 include red leuco dye and 3,3' ,5, 5-tetramethy (benzidine (TMB). 

[0074] Examples of additional compounds incorporated into probes to allow immobilization of the probes include a 
long poly-dT 'tail" that can be fixed to a nylon support by irradiation, a technique described in more detail in PCT Patent 
Publication No. WO 89/11548. 

[0075] Suitable assay methods for detecting hybrids formed between probes and target nucleic acid sequences in 
25 a sample are known in the art (Sambrook et al., 1 985, supra). Examples include the dot blot and reverse dot blot assay 
formats. 

[0076] In a dot blot format, unlabeled amplified target DNA is immobilized on a solid support, such as a nylon mem- 
brane. The membrane-target complex is incubated with labeled probe under suitable hybridization conditions, unhy- 
bridized probe is removed by washing under suitably stringent conditions, and the membrane is monitored for the 

30 presence of bound probe. 

[0077] An alternate format is a "reverse" dot blot format, in which the amplified target DNA is labeled and the probes 
are immobilized on a solid support, such as a nylon membrane. The target DNA is typically labeled during amplification 
by the incorporation of labeled primers. The membrane-probe complex is incubated with the labeled sample under 
suitable hybridization conditions, unhybridized sample is removed by washing under suitably stringent conditions, and 

35 the filter is then monitored for the presence of bound target DNA. r ' 

[0078] Alternatively, the reverse dot blot assay may be carried out using a solid support having a plurality of probe 
hybridization sites or wells. For example, a microwell plate is particularly useful in large scale clinical applications of 
the present methods. A reverse dot blot assay utilizing a microwell plate is described by Loeffelholz etal., 1992, In J. 
Clin. Microbiol. 30 (11): 2847-2851. Probes can be immobilized to a microwell plate either by passive binding or by 

^0 first binding the probes to bovine serum albumin (BSA), which adheres to microwell plates. 

[0079] Another suitable assay method system is described in U.S. Patent No. 5,210,015, in which a labeled probe 
is added during the PCR amplification process. The probes are modified so as to prevent the probe from acting as a 
primer for DNA synthesis. Any probe which hybridizes to target DNA during each synthesis step is degraded by the 
5'-to-3* exonuclease activity of the DNA polymerase. The degradation product from the probe is then detected. Thus, 

45 the presence of probe breakdown product indicates that hybridization between probe and target DNA occurred. 

[0080] The present invention also relates to kits, mufticontainer units comprising useful components for practicing 
the present method. A kit will contain a combination of preferred polymerase enzymes in the concentration ratios 
described herein. Additional components which may be contained in a useful kit include primers for PCR amplification 
and reagents for carrying out the PCR methods of the present invention. '< 

so [0081] The ability to amplify sequences of 1 0-40 kb has a number of applications in areas such as genome mapping, 
sequencing, and genetics. Small gaps in the genome maps that currently appear resistant to molecular cloning may 
be accessible by amplification of a sequence between known flanking sequences. The amplification of longer targets 
would also allow greater flexibility in choosing primers to avoid problematic sequences, such as that seen in the beta- 
globin gene system described below. Longer templates promise to speed the process of genomic sequencing as well, 

55 by increasing the distance covered with each sequencing step. From known expressed sequences, amplifications can 
be carried out spanning longer introns, and more complete genes sequences can be amplified at one time. Long PCR 
therefore complements technologies for rapid, long-range sequencing. PCR-based characterization and diagnosis of 
both homozygotes and heterozygote carriers of a number of medically important insertions and deletions of great r 
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than 4 kb would also b possible. 

[0082] The results presented her specifically demonstrate the potential application of thes protocols to th char- 
acterization of cloned sequences. The J and cro gene primers, CF1018 (SEQ ID NO: 23) and CF1019 (SEQ ID NO: 
24), described below should be useful for nearly all inserts cloned with lambda-based vectors, for amplifications from 
5 both plaques and isolated DNA. The PCR products are readily analyzed by restriction digests and should be suitable 
for sequencing. Cosmid inserts may also be amplifiable from colonies. Long PCR will facilitate molecular cloning by 
amplifying low-copy insert material, and facilitate assembly of larger recombinant constructions in PCR-based muta- 
genesis. 

[0083] The examples of the present invention presented below are provided only for illustrative purposes and not to 
10 limit the scope of the invention. 

Example 1 

Materials and Methods 

15 . 

[0084] Preferred protocols and reagents for the PCR amplification of long phage lambda and human beta-globin 
gene cluster sequences are described below. The results of amplifications using the following methods are described 
in the subsequent examples. 

20 Target Nucleic Acid Sequences 

[0085] Two template nucleic acid sequences were used for the design of the amplification primers described below, 
the sequence of the phage lambda genome (Gen Bank accession number M 17233) and the sequence of the human 
beta-globin gene cluster (Gen Bank accession number J00179). Phage lambda and human DNA were used in the 

25 amplifications described below. 

[0086] Lambda DNA (1 ng/u.l) was obtained from Perkin Elmer, Norwalk, CT. Aliquots (-1 00 ng) of lambda DNA were 
thawed once, then stored at 4°C. Total. genomic DNA from human placenta was obtained from Sigma Chemicals, St. 
Louis, MO. All template DNA dilutions were made with 10 mM Tris»CI (pH 8 at 25°C), 0.1 mM EDTA. 
[0087] A library of human genomic clones in lambda FIX II was obtained from Stratagene, La Jolla, CA, and grown 

30 as recommended by the manufacturer, on Luria broth agar plates with top agarose. Randomly selected plaques were 
removed using siliconized Pasteur pipettes, and placed in 30 u.l of 25 mM Tris*CI (pH 8.3), 10 mM MgCI 2 and stored 
at 4°C. Aliquots of 1 u.l were used for PCR. 

[0088] Total genomic DNA from a tymphoblastoid cell line (KAS011 B) was isolated using 0.1 mg/ml proteinase K- 
and 0.5% SDS in 1 0 mM Tris^CI (pH 8), 150 mM NaCI, and 10 mM EDTA, overnight at 50°C. Following extraction with 
35 Tris-satu rated phenol (pH 8),. and ethanol-precipitation with NaOAc, the sample was treated with RNase A, then ex-~ 
tracted with phenol-chloroform, and dialyzed against 10 mM TrisOl (pH 8), 1 mM EDTA. 

Primers 

40 [0089] A set of primers was designed to enable the PCR amplification of lambda genomic target sequences ranging 
in size from 1 .5 to 42.2 kilobases in length. Upstream primers were designed to be used with each of the downstream 
primers, resulting in a series of target sequences increasing in length by 1 to 3 kilobases. 

[0090] Each primer of the set was designed so as to have approximately the same optimal annealing temperature 

(~68°C) by selecting primer sequences between 20 and 23 base pairs in length such that the hybrid duplex formed 
45 between the primer and target sequence would have an overall composition of 12 G-C pairings and 8-11 A-T pairings. 

Optimal annealing temperatures were estimated using the Tp" algorithm of Wu et al., 1 991 , DNA Cell Biol. 1 0:233-238. 

[0091] An additional pair of primers, the J and cro gene primers, were designed to enable amplification of nearly all 

inserts cloned with lambda-based vectors, from either plaques or isolated DNA, is shown in Table 2, below. 

[0092] Similarly, primers were designed for the amplification of regions of the human beta-globin gene cluster. The 
so primers were designed such that a fixed downstream primer could be used with a series of upstream primers to amplify 

targets of 7.5-22 kb. The primers amplify a target region extending upstream across the delta-globin gene and into the 

second intron of the A-gamma globin gene. 

[0093] The nucleotide sequences of the primers used in the following examples are shown (5'-to-3') in Table 2, below. 
Melting temperatures (T m ) were calculated essentially as described in Wetmur, 1991, Crit. Rev. Biochem. Mol. Biol. 
55 26:227-259. Melting temperature calculations were carried out assuming 2 dangling ends, 3.5 u.g/ml (-0.5 jiM) primer, 
80 mM Na + , and 1 .5 mM Mg 2+ . Calculated melting temperatures ranged from 63-70°C. The addition of 10% glycerol 
decreases the T m by 2.5°C. Primer nucleotide s quences were valuated for potential secondary priming sites within 
the template DNA sequences and for inter- and intra-primer sequence complementation using th Oligo 4.0 softwar 
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Table 2 
Amplification Primers 
Primer Seq ID No. Sequgrire Position 3ml!£l 

primers for phage lambda (GenBank accession no. M17233): 



CF1001 


1 


GGTGCTTTATGACTCTGCCGC 


304-324 


67 


scion 


2 


GCTGAAGTGGTGGAAACCGC 


506-525 


67 


CF1005 


3 


GCTCTTTCCGCTCTGCCATC 


* 1841-1860 


66 


CF1007 


4 


CGGCACTGGCAAGCAACTGA 


* 4921-4940 


67 


CF1008 


5 


CCTCAACCGGATCGAAGGCT 


* 6569-6588 


67 


CF1010 


6 


AGCGTGACGGTCACACCGTT 


* 9741-9760 


70 


SC1012 


7 


GACTCTGGCCATCTGCTCGT 


* 10600-10619 


65 


CF1012 


8 


GGACCTATCTGCCCGTTCGT 


* 12981-13000 


67 


CF1013 


9 


GCCACCAGTCATCCTCACGA 


* 14551-14570 


65 


SC1000 


10 


GCAGCGTGATTTCACGGTCG 


* 17025-17044 


69 


SC1001 


11 


GCTCACATAACGTCCACGCAG 


* 19259-19279 


67 


SC1002 


12 


GCCTCGCATATCAGGAAGCAC 


* 21359-21379 


66 


SC1003 


13 


GGGTGACGATGTGATTTCGCC 


* 23335-23355 


67 


SC1008 


14 


GGCATTCCTACGAGCAGATGGT 


* 26893-26914 


66 


SC1009 


15 


GGTCTGCCTGATGCTCCACT 


* 28536-28555 


64 


SC1016 


16 


GTCGGACTTGTGCAAGTTGCC 


* 30436-30456 


67 


SC1017 


17 


GCATGGATTCTGTCGACCCAC 


* 32741-32761 


65 


SC1018 


18 


GAGAACCACCG AGCCTG ATG 


* 34413-34432 


64 


SC1019 


19 


AGCATTGGCCGTAAGTGCGATT 


* 35454-35475 


69 


SC1021 


20 


GGCCTTGTTG ATCGCG CTTTG A 


•38118-38139 


70 


SC1022 


21 


TGTCACGCCTGCCTGTTGCTT 


* 39505-39525 


68 


SC1024 


22 


GCGTTCCGCACGAGATACATG 


* 42730-42750 


68 


Lambda vector primers, from the J and cro gene sites of phage lambda: 




CF1018 


23 


AGAAACAGGCGCTGGGCATC 


18872-18891 


67 


CF1019 


24 


CGGGAAGGGCTTTACCTCTTC 


* 38197-38217 


66 



Primers for human beta-globin gene cluster (accession No. J00179): 



RH1019 


25 


CTGCTGAAAGAGATGCGGTGG 


54529-54549 


65 


RH1020 


26 


CTGCAGTCCCAGCTATTCAGG 


52152-52172 


63 


RH1022 


27 


CGAGTAAGAGACCATTGTGGCAG 


48528-48550 


65 


RH1024 


28 


TTGAGACGCATGAGACGTGCAG 


44348-44369 


67 


RH1025 


29 


CCTCAGCCTCAGAATTTGGCAC 


42389-42410 


65 


RH1026 


30 


GAGGACTAACTGGGCTGAGACC 


40051-40072 


65 


RH1016 


31 


CAGCTCACTCAGTGTGGCAAAG 


* 62589-^2610 


64 


RH1053 


32 


GCACTGGCTTAGGAGTTGGACT 


* 61986-62007 


65 



Downstream primer complementary to position numbers listed* 



[0094] Primers were synthesized using the cyanoethoxyphosphophoramidite method (1 u,M scale) on a 394 DNA 
Synthesizer (Applied Biosystems, Foster City, CA). The primers were deprotected and cleaved from the resin in 29% 
NhVH 2 0, then desalted with Sephadex G25 (NAP-10 columns from Pharmacia LKB, Piscataway, NJ). The results of 
each synthesis were assessed by polyacrylamide gel electrophoresis. All primer stocks were made with 1 0 mM Tris*CI 
(pH 8 at 25°C), 0.1 mM EDTA. 
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Thermostable DNA Polymerases 

[0095] Recombinant Tth DNA polymerase (rTth) was purchased from Perkin Elmer, Norwalk, CT. The 77/ DNA 
polymerase is described in U.S. Patent No. 5,210,036. The 77/ DNA polymerase (Ventp® ) and DNA polymerase from 
s Pyrococcus species GB-D (Deep Vent^® ) were purchased from New England Biolabs, Beverly, MA. The Tma DNA 
polymerase is described in International Patent Publication No. WO 92/03556, and referred to therein as pTma12-3. 
A modified DNA polymerase from Thermatoga marftima is commercially available from Perkin Elmer, Norwalk, CT 
(UlTma™). 

[0096] Dilutions (1/5 and 1/1 0) of the Vent^® and Deep Vent^® DNA polymerases preferably may be made in storage 
10 buffers as described by each manufacturer. In the Examples below, however, the Ven^® dilution buffers used contained 

I mM EDTA and 0.05% Tween 20 (Sigma Chemicals, St. Louis, MO) instead of 0.1% Triton X-100. This modification 
had no effect on the amplification reactions. Vent R ® polymerase dilutions were made fresh weekly; Deep Ventp® 
polymerase was diluted just before use. The polymerase can also be stored in the rTth DNA polymerase storage buffer 
supplied by the manufacturer (100 mM KCI, 20 mM Tris-HCI, pH 8.0, 0.1 mM EDTA, 1mM DTT, 0.5% Tween® 20; 50% 

is (v/v) glycerol). .' * - " 

Additional Buffer Components \ 

[0097] Standard Tth polymerase buffer (5% (v/v) glycerol, 10 mM Tris»CI (pH 8.3), 100 mM KCI, 0.75 mM EGTA, 
20 0.05% Tween 20) for PCR was obtained from Perkin Eimer, Norwalk, CT. Trictne buffer stocks (Sigma Chemicals; St. 
Louis, MO) at 1 .0 M were adjusted to their final pH (at 25°C) with KOH. Molecular biology grade dimethylsulfoxide 
(DMSO) and glycerol were from Sigma Chemicals, St Louis, MO, and J.T. Baker Chemicals, Phillipsburg, N J, respec- 
tively. Potassium acetate (KOAc) was also obtained from J.T Baker Chemicals. The contribution of glycerol (typically 
£ 1 %) from enzyme storage buffers was not included in the glycerol concentrations given for any PCR buffer described 
25 herein. 

PCR Methods f. 

[0098] All lambda genomic DNA amplifications were performed in a GeneAmp® PCR System 9600 thermal cycler, 
30 using MicroAmp™ tubes with individual caps, all marketed by Perkin Elmer, Norwalk, CT. Reaction volumes were either 
50 or 100 u.l. The concentration of each dNTP was 0.2 mM for all reactions, but other reaction components were varied 
as discussed in the text and listed in Table 1. 

[0099] To minimize the amplification of non-specific sequences and the formation of primer-dimers, manual "hot- 
starts" were performed in which the Mg 2+ was withheld until the samples had been incubated in the thermal cycler at 
35 75-80°C for -90 seconds. The necessary Mg 2 * was then added from a 25 mM stock (at room temperature): Following 
the addition of Mg 2+ , the samples were incubated for an additional 30-60 seconds, for a total of 4-7 minutes at 75-80°C 
prior to the first den atu ration step. The total time includes the time required to add the Mg 2 *, and therefore depends 
upon the total number of tubes. An alternate "hot-start" procedure is described in Example 6. 

[0100] The thermal cycler was programmed to carry out a two-step temperature profile. Each amplification cycle 
40 consisted of denaturation at 94° C for 1 0 seconds followed by annealing and extension at 68°C for 5-20 minutes. A 15 
second denaturation step can also be used. For annealing and extension times longer than 12-14 minutes, the au- 
toextension feature of the thermal cycler was used to add 15-20 seconds per cycle, to a final -16-22 minutes ' Reactions 
were carried out for between 25 and 40 cycles, depending upon the starting target sequence copy number/the target 
length, and the reaction conditions. In most reactions, an initial 10 second incubation step at 94° C and a final 10 minute 
45 incubation step at 72°C were included. 

[0101] The amplifications, described in the following examples, of human genomic inserts cloned into lambda FIX II 
and of regions of the human beta-globin gene cluster were carried out essentially as described above, but with the 
modifications detailed below. Specific conditions for the amplification of human genomic inserts cloned in lambda FIX 

II from plaque suspensions in 100 u,l reaction volumes were as follows. 

50 

25 mM tricine (pH 8.7) 
85 mM KOAc 
1 2% (w/v) glycerol 
0.2 mM each dNTP 
55 o.4 u.M each primer 

1 .75 U Tth polymerase ". - . 

0.02 U 77/ polymerase 
1.15 mM Mg(OAc) 2 



13 



EP 0 669 401 B1 



An 80°C hot-start was used with a two-step th rmal cycling profile, as described above. Th ann aling and extension 
step was initially 12 minutes at 68°C and extended by 15 seconds per cycle for 32 cycles. 

[0102] Specific conditions for the amplification of a region of the human beta-globin gene cluster from 37 ng of 
KAS011 DNA in 50 jil reaction volumes were as follows. 

5 

20 mM tricine (pH 8.7) 
85 mM KOAc 
1 0% (w/v) glycerol 
2% (v/v) DMSO 
10 0.2 mM each dNTP 

0.2 (iM each primer 
0.9 U Tth polymerase 
0.02 U 77/ polymerase 
1.1 mM Mg(OAc) 2 

A 78°C hot-start was used with a two-step thermal cycling profile, as described above. The annealing and extension 
step was initially 12 minutes at 68° C for 12 cycles, then extended 1 5 seconds per cycle for 24 cycles. 
[0103] Increased yields of amplified product may be obtained by the addition of up to 500 u.g/ml of nonacetylated 
. BSA to the amplification reaction. 

20 

Analysis of PCR Products * 

[0104] Typically, 5-8 (il from each PCR amplification were analyzed on standard horizontal gels consisting of 0.6% 
(w/v) SeaKem GTG agarose (FMC BioProducts, Rockland, ME) in 1XTBE (89 mM Tris base, 89 mM boric acid, 1 u,M 
25 to 2 mM EDTA) or 1 X TAE (40 mM Tris-acetate, 2 mM EDTA, pH 8-8.5) with 0.5 u.g/ml ethidium bromide, at about 4-6 
V/cm for 1 .5-2 hours. For greater size resolution, two alternatives were used: field inversion gel electrophoresis and 
0.3% agarose gel electrophoresis. 

[0105] Field inversion gel electrophoresis (FIGE) was performed using a Hoefer system (SuperSub gel apparatus, 
Switchback pulse controller, and power supply, all from Hoefer, San Francisco, CA) with a cooling unit (221 9 Multttemp 
30 || from Pharmacia LKB). Between 3 and 7 u.l from each PCR amplification were analyzed on FIGE gels of 0.95% ■ 
agarose in 0.5x TBE (at 1 u.M EDTA). The FIGE gels were prerun for 15 minutes at 110 V, then run for 22-25 hours at 
140-145 V, with pulse times of 0.65-1 .95 or 0.75-2 seconds (forward:reverse = 2.8:1 or 3:1). Run temperatures were 
estimated at 12-1 5°C. 

[0106] Alternatively, load 2-5 u.l on 0.3% Chromosomal Grade agarose (Bio-Rad, Richmond, CA) or Seakem GTG 
35 or Gold (FMC BioProducts, Rockland, ME) in 1X TAE. Cool the gel to 4°C before removing the comb. Load 5-8 u.l of 
sample and run in 1 X TAE with 0.5% ethidium bromide at 100 V for 2 minutes, then either at 1 .5 V/cm for 6 hours or 
at 0.7 V/cm for 16 hours. ? 
[0107] The size of the amplified products was determined by comparison with molecular weight markers run on each 
gel in addition to the sample. Molecular weight markers used were lambda/H/ndlll from either New England Biolabs 
40 or Gibco BRL, lambda/mono cut mix from New England Biolabs, and 1-kb ladder from Gibco BRL. 

[0108] For restriction analyses, aliquots (10-16 u.l) of PCR amplification product from lambda DNA amplifications 
were digested with Bcl\, BssHU, and Mfu\ (New England Biolabs); or BamH\, EcoR\, and H/ndlll (Gibco BRL), using 
the manufacturer's buffers, prior to electrophoresis. Digestions were carried out for 2.5-3 hours in 30-36 u.l reactions. 
Samples were analyzed using 0.6-8% agarose gels. Aliquots of plaque PCR samples (1 0-30 u.l aliquots) were digested 
45 with Noii (Stratagene) overnight in 40 u.l reactions. 

Example 2 

Amplification of Phage Lambda Genomic Sequences 

so 

[0109] Amplifications were carried out using target sequences from high copy (1 0 7 -1 0 8 copies of target) phage lamb- 
da DNA samples as described in Example 1 , above. Targets of 1 .5 to 42.2 kb were defined within this ~50-kb sequence 
(GenBank M1 7233) by the various pairings of the primers listed in Table 2, above. 

[0110] Amplified product was analyzed by field inversion gel electrophoresis (FIGE) and visualized with ethidium 
55 bromide staining. Total yields (per 50 uJ), as estimated by comparison with a lambda/H/ndll I molecular weight marker, 
were estimated at between 0.7-1 u.g of 22.8 kb product and 0.2-0.3 |xg of 39-kb product. A 42.2 kb target, amplified 
using primers SC1011 (SEQ ID NO: 2) and SC1024 (SEQ ID NO: 22), was amplified with lower yields. 
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Example 3 

Amplification of Lambda Clones From Plaques 

s [01 1 1 ] One important use for the methods of the present invention is the amplification of inserts from lambda clones 
without prior, labor and time intensive DNA isolations. To demonstrate the utility of the present methods to the ampli- 
fication of such inserts, primers CF1 01 8 (SEQ ID NO: 23) andCF1019 (SEQ ID NO: 24) were designed from sequences 
within the J and cro genes of lambda (see Table 2). 

[01 12] Amplifications were carried out as described in Example 1 using randomly selected plaques from the human 
10 genomic library in lambda FIX II described in Example 1 . Amplification products were analyzed by gel electrophoresis 
following digestion with Notl to separate the insert from flanking vector sequences. The presence of both vector frag- 
ments confirms that the entire insert was amplified. 

[01 13] The size of the amplified inserts ranged from less than 1 0 kb to greater than 20 kb. The manufacturer estimates 
that insert sizes of 9-23 kb are accommodated by this lambda vector. Inserts were sized by their mobility relative to 
is molecular weight markers in FIGE gels. 

Example 4 j 

Amplifications of Human Genomic Targets 

20 

[0114] The human beta-globin gene cluster was chosen as a model for genomic targets that are likely to contain 
repetitive sequences and homologous sites elsewhere in the genome. Primers designed for the human beta-globin 
gene cluster are shown in Table 2, above. A fixed downstream primer was paired with a series of upstream primers 
that amplify a region extending upstream across the delta-globin gene and into the second intron of the A-gamma 
25 giobtn gene. Targets of 13.5, 17.7, 19.6, and 22 kb were amplified from 37 ng (-10 4 copies) of total human genomic 
DNA as described in Example 1 . Aliquots of 12.5 u.l of the amplified products were loaded on FIGE gels. A lambda/ 
Hin6\\\ molecular weight marker was used for comparison. 

[01 15] For comparison, targets of 1 6.5, 1 8.8, 20.8, and 22.8 kb were amplified from 0.05 pg. (-1 0 3 copies) or 0.5 pg 
(-1 0 4 copies) phage lambda DNA in a background of -3.7 ng or 37 ng, respectively, human placental genomic DNA, 
30 under the same conditions. By also amplifying from a low input target number a target previously amplified from a high 
input target number, the effects attributable to a decrease in input target copy number can be separated from the effects 
attributable to a difference in target sequence. 

[01 16] Target sequences up to 22 kb in length of the beta-globin gene cluster were amplified. The beta-globin targets 
were amplified less efficiently than lambda sequences of similar length that were at a single-copy level in a background 

35 of human placental DNA, either at the same overall concentration as the globin target or at a 1 0-fold lower concentration. 
These efficiency differences may reflect the relative sequence complexities, even though the lambda target was also 
in a human genomic background. The increased likelihood that long targets will contain sites sufficiently homologous 
to act as secondary primer annealing sites, and the presence of repetitive sequences in human genomic sequences, 
may explain why lambda targets were more efficiently amplified then beta-globin gene targets of comparable length. 

40 [0117] The problem of secondary priming sites also affected the choice of suitable primers for the amplification of 
beta-globin gene targets. Downstream primer RH1 053 (SEQ ID NO: 32), which hybridizes 5' to the beta-globin gene, 
was chosen because RH1 01 6 (SEQ ID NO: 31 ) t which hybridizes within exon 2 of the beta-globin gene, also hybridizes 
to a secondary sites within targets longer than 14 kb, resulting in multiple products. The upstream primer RH1020 
(SEQ ID NO: 26) resulted in multiple secondary products, as did the use of two other primers (not shown) within 1 00 

45 bases of RH1 020 (SEQ ID NO: 26). All three lie within an Alu repeat sequence. 

[0118] Results from amplifications of sequences up to 16 kb in length from the human neurofibromatosis- 1 gene 
also suggested that methods to insure primer specificity are crucial to efficient PCR amplification of long target se- 
quences. 

50 Example 5 

DNA Polymerase Combinations 

[01 19] To access the relative efficiency of various DNA polymerase combinations, amplification reactions were car- 
55 ried out essentially as described in Example 1 , above, using primers which amplify target sequences 22.8, 26.4, 29.9, 
and 33.9 kb in length. The DNA polymerase combinations compared were as follows: 

2.5 U rTth DNA polymerase + 0.02 U Vent^® DNA Polymeras 



15 



EP 0 669 401 B1 



2.5 U rTth DNA polymeras + 0.06 U Deep Vent^® DNA Polymerase 
3.15 U rTth DNA polymerase + 0.5 U Tma DNA Polymeras 



[0120] All reactions were carried out in 50 u.l, with 1 0 7 copies of lambda DNA, 0.45 u,M each primer and 1 .0-1 .1 mM 
5 Mg(OAc) 2 . Amplification reactions using the following specific conditions. 

[0121] Reactions using either rTth and Vent R <© or rTth and Deep Vent R ® DNA polymerases were carried out in 20 
mM triclne (pH 8.7), 85 mM KOAc, 10% glycerol, and 3% DMSO. Reactions using rTth and Tma DNA polymerases 
were carried out in 20 mM tricine (pH 8.7), 85 mM KOAc, 10% glycerol, and 2.5% DMSO. 

[0122] The temperature cycling profile was essentially as described In Example 1, above. An initial 13-minute ex- 
10 tension time was used for the first 9 cycles. The extension time was then increased to 1 3.5 minutes and increased 20 
seconds in each subsequent cycle for 1 8 cycles. Seven u.l aliquots of each reaction were loaded on a standard agarose 
gel along with 150 ng of the lambda/H/ndlll molecular weight marker. 

[0123] All templates (to 33.9 kb) were amplified using combinations of rTth DNA polymerase with Ventp® , Deep 
VentR® , and Tma DNA polymerases. The combination of 2.5 U rTth DNA polymerase and 0.02 U Verity© DNA Polymer- 
's ase amplified all targets with the greatest efficiency. 

Example 6 



PCR Amplification Kit 

20 

[0124] The reagents of the invention are suitable for inclusion in a kit for carrying out the PCR amplification of long 
target sequences. A kit contains at least a DNA polymerases mixture as described herein. Additional, optional, com- 
ponents include additional reagents and reaction containers used in the reactions as described below. \ 
[01 25] A preferred combination of DNA polymerases useful for amplifying both high copy and low copy targets con- 
25 sists of rTth and Ventp© DNA polymerases in a ratio of 2 units of rTth DNA polymerase to 0.08 units of VentR® DNA 
polymerase. Although, as shown below, the preferred polymerase concentration for the amplification of high copy 
targets is twice the preferred concentration for the amplification of low copy targets, the ratio of primary to secondary 
polymerases is the same. 

[0126] A reaction buffer suitable for inclusion in a kit consists of tricine, KOAc, glycerol, and DMSO in about the 
30 following concentrations: 



25 mM tricine (pH 8.7) 
80 mM KOAc 
10% (w/v) glycerol 
35 2.25% (v/v) DMSO 



[01 27] The term "about" is meant to encompass a standard plus or minus 1 0% manufacturing tolerance. For con- 
venience, the reaction buffer may be stored at a higher concentration and diluted before using. 
[0128] Amplifications are carried out using the preferred kit components essentially as described above, but using 
40 the preferred reaction conditions described below. These reagents and conditions have been used extensively and 
have been found to provide reliable amplification of long target sequences. 

[0129] Preferred conditions for the amplification of low copy (e.g. human genomic) targets (2.0 x 1 0 4 copies) in 1 00 
u.l reaction volumes are as follows. ■■■ 



45 25 mM tricine (pH 8.7) 

80 mM KOAc 

1 0% (wAO glycerol 

2.25% (v/v) DMSO 

0.2 mM each dNTP 
50 0.2 u.M each primer 

2 U rTth polymerase 

0.08 U Ventp® polymerase 

1.1 mM Mg(OAc) 2 

55 [01 30] Preferred cycling parameters for the amplification of low copy targets (>1 0 kb) are as follows: 



Denatu ration 94°C 1 minute 
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(continued) 



5 



20 Cycles 


94°C 


1 5 seconds 




68°C 


12 minutes 


17 Cycles 


94°C 


15 seconds 




68°C 


12 minutes with 15 second autoextend 


Final Extend 


72°C 


10 minutes 


Hold 


4°C 


indefinite 



10 

[0131] Preferred conditions for the amplification of high copy (e.g. cloned DNA) targets (2.0 x 1 0 7 copies) In 100 uJ 
reaction volumes are as follows. 

25 mM tricine (pH 8.7) 
15 80 mM KOAc 

1 0% (w/v) glycerol 

2.25% (v/v) DMSO 

0.2 mM each dNTP 

0.4 u.M each primer 
20 4 U rTth polymerase 

0.16 U Vent^® polymerase 

1.1 mM Mg(OAc) 2 

[0132] Preferred cycling parameters for the amplification of high copy targets (>1 0 kb) are as follows: 

25 



Denaturation 


94°C 


1 minute 


1 6 Cycles 


94°C 


15 seconds 




68°C 


10 minutes 


1 2 Cycles 


94°C 


15 seconds 




68°C 


10 minutes with 15 second autoextend 


Final Extend 


72°C 


10 minutes 


Hold 


4°C 


indefinite 



35 

[0133] A "hot-start" is achieved by separating reagents within the reaction tubes using Ampliwax™ PCR Gem 100 
wax beads, developed and manufactured by Hoffmann-La Roche and marketed by Perkin Elmer, Norwalk, CT. A 40 
uJ bottom reagent layer containing buffer (tricine, KOAc, glycerol, and DMSO), Mg(OAc) 2 , and the dNTP's is added to 
the reaction tube. A wax layer is established over the bottom layer by adding an Ampliwax™ PCR Gem 100 and 
40 incubating in a thermal cycler first at 80°C for 5 minutes, and then at 25°C for 5 minutes. A 60 u.l top reagent layer is 
then added containing buffer, the DNA polymerase mixture, the primers, and the target DNA. 

[0134] Samples are analyzed as described above on a 0.6% Agarose gel in 1 X TAE and 0.5 u.g/ml EtBrfor 1 .5 hours 
at7V/cm. 

45 SEQUENCE LISTING 

[0135] 

(1) GENERAL INFORMATION: 

50 

(i) APPLICANT: 

(A) NAME: F.Hoffmann-La Roche AG 

(B) STREET: Grenzacherstrasse 124 
55 (C) CITY: Basel 

(D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP): CH-4002 
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(G) TELEPHONE: (0)61 688 24 03 

(H) TELEFAX: (0)61 688 13 95 

(I) TELEX: 962292&965S42 hlrch 

5 (ii) TITLE OF INVENTION: Amplification of Long Nucleic Acid Sequences by PCR 

(iii) NUMBER OF SEQUENCES: 32 

(iv) COMPUTER READABLE FORM: 

10 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1 .0 P Version #1 .30 (EPO) 

15 

(2) INFORMATION FOR SEQ ID NO:1 : 
(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

GGTGCTTTAT G AC TCTGCCG C 21 

30 (2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GCTGAAGTGG TGGAAACCGC 2 0 

(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 



45 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

so (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

55 

GCTCTTTCCG CTCTGCCATC 

(2) INFORMATION FOR SEQ ID NO:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:4: 



CGGCACTGGC AAGCAACTGA 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CCTCAACCGG ATCGAAGGCT 

(2) INFORMATION FOR SEQ ID NO:6: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

AGCGTGACGG TCACACCGTT 

(2) INFORMATION FOR SEQ ID NO:7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GACTCTGGCC ATCTGCTCGT 

(2) INFORMATION FOR SEQ ID NO:8: 
(i) SEQUENCE CHARACTERISTICS: 
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10 



25 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

GGACCTATCT GCCCGTTCGT ~ 20 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEO ID NO:9: 

GCCACCAGTC .ATCCTCACGA 

(2) INFORMATION FOR SEQ ID NO:10: 
(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ji) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



GCAGCGTGAT TTCACGGTCG 20 

40 

(2) INFORMATION FOR SEQ ID NO:11: 
(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

so (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11 : 

G C TC AC AT AA CGTCCACGCA G 21 

55 (2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 



20 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:12: 

1Q GCCTCGCATA TCAGGAAGCA C 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

15 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

25 G G G TG AC G AT GTGATTTCGC C 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

30 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

GGCATTCCTA CGAGCAGATG GT 

40 

(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

so (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

GGTCTGCCTG ATGCTCCACT 

55 (2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 
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10 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

GTCGGACTTG TGCAAGTTGC C 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

is (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

GCATGGATTC TGTCGACCCA C 

25 (2) INFORMATION FOR SEQ ID NO:1 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



GAGAACCACC GAGCCTGATG 

(2) INFORMATION FOR SEQ ID NO:19: 
(i) SEQUENCE CHARACTERISTICS: 



40 



50 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 



AGCATTGGCC GTAAGTGCGA TT 

(2) INFORMATION FOR SEQ ID NO:20: 
55 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

GGCCTTGTTG ATCGCGCTTT GA 



(2) INFORMATION FOR SEQ ID NO:21 : 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

TGTCACGCCT GCCTGTTGCT T 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GCGTTCCGCA C GAG AT AC AT G 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



AGAAACAGGC GCTGGGCATC 



(2) INFORMATION FOR SEQ ID NO:24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CGGGAACGGC TTTACCTCTT C 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



CTGCTGAAAG AGATGCGGTG G 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



CTGCAGTCCC AGCTATTCAG G 

(2) INFORMATION FOR SEQ ID NO:27: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CGAGTAAGAG ACCATTGTGG CAG 

(2) INFORMATION FOR SEQ ID NO:28: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

TTGAGACGCA TGAGACGTGC AG 22 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CCTCAGCCTC AGAATTTGGC AC 22 

(2) INFORMATION FOR SEQ ID NO:30: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

GAGGACTAAC TGGGCTGAGA CC 22 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31 : 

CAGCTCACTC AGTGTGGCAA AG 22 

(2) INFORMATION FOR SEQ ID NO:32: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

GCACTGGCTT AGGAGTTGGA CT 2 2 



Claims 



A DNA polymerase composition for the polymerase chain reaction amplification of nucleic acid sequences longer 
than 10 kb consisting of a combination of a first DNA polymerase and a second DNA polymerase, wherein said 
first DNA polymerase is Thermus thermophilus DNA polymerase, and wherein said second DNA polymerase is 
selected from the group of DNA polymerases consisting of Thermococcus fitoralis DNA polymerase, Pyrococcus 
species GB-D DNA polymerase, and Thermotoga maritima DNA polymerase and wherein said DNA polymerase 
composition consists of about 0.8-2.5 units of first DNA polymerase for each 0.015-0.15 units of second DNA 
polymerase. 

2. A DNA polymerase composition of claim 1 whereas the nucleic acid sequences to be amplified are in the 23-42 
kb range. 

3. The DNA polymerase composition of claim 1 and 2, wherein said second DNA polymerase is Thermococcus litoralis 
25 DNA polymerase. * ■ 

4. The DNA polymerase composition as claimed in any of claims 1 to 3, wherein said DNA polymerase composition 
consists of about 2 units of first DNA polymerase for each 0.08 units of second DNA polymerase. 

30 5. A reaction buffer for the polymerase chain reaction amplification of nucleic acid sequences longer than 10 Kb 
comprising a DNA polymerase composition as claimed in any of claims 1 to 4 and further comprising about 25 mM 
tricine, 80 mM KOAc, 10% (w/v) glycerol, and 2.25% (v/v) DMSO. 

6. A process for amplifying a nucleic acid using a DNA polymerase composition as claimed in any one of daims 1 to 4. 

35 

7. Use of a DNA polymerase composition as claimed in any one of claims! to 4 for amplifying a nucleic acid. 

8. A kit comprising a DNA polymerase composition as claimed in any one of claims 1 to 4. 

40 g. As kit of claim 8 further comprising a reaction buffer for the polymerase chain reaction amplification of long nucleic 
acid sequences, wherein said reaction buffer preferably comprises about 25 mM tricine, 80 mM KOAc, 10% (w/v) 
glycerol, and 2.25% (v/v) DMSO. 

45 Patentanspruche 

1. DNA-Polymerase-Zusammensetzung zur Amplifikation von Nucleinsauresequenzen mit einer Laxige von mehrals 
10 kb mithiife der Poiymerasekettenreaktion, bestehend aus einer Kombination einer ersten DNA- Polymerase und 
einer zweiten DNA- Polymerase, wobei die erste DNA- Polymerase eine DNA- Polymerase aus Thermus thermo- 
so phitus ist und wobei die zweite DNA-Polymerase ausgewahlt ist aus der Gruppe von DNA- Polymerase n bestehend 

aus DNA-Polymerase aus Thermococcus litoralis, DNA-Polymerase aus der Pyrococcus- Art GB-D und DNA-Po- 
lymerase aus Thermotoga maritima, und wobei die DNA-Polymerase-Zusammensetzung aus etwa 0,8 bis 2,5 
Einheiten einer ersten DNA-Polymerase pro 0,015 bis 0,15 Einheiten einer zweiten DNA-Polymerase besteht. 

55 2. DNA-Polymerase-Zusammensetzung nach Anspruch 1, wobei die zu amplifizierenden Nucleinsauresequenzen 
im Bereich von 23 bis 42 kb liegen. 

3. DNA-Polymerase-Zusammensetzung nach Anspruch 1 oder 2, wobei die zweite DNA-Polymerase die DNA-Po- 
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lymerase aus Thermococcus litoralis ist. 
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4. DNA-Polymerase-Zusammensetzung nach einem der Anspruche 1 bis 3, wobei die DNA-Polymerase-Zusammen- 
setzung aus etwa 2 Einheiten einer ersten DNA-Polymerase pro 0,08 Einheiten einer zweiten DNA-Polymerase 

5 besteht. 

5. Reaktionspuffer fur die Amptif ikation von Nucleinsauresequenzen mit einer LSnge von mehr als 1 0 kb mithiife der 
Poiymerasekettenreaktion, umfassend eine DNA-Polymerase-Zusammensetzung nach einem der Anspruche 1 
bis 4 und weiterhin etwa 25 mM Tricin, 80 mM KOAc, 1 0 Gew.-% Giycerin und 2,25 Vol.-% DMSO. 

10 

6. Verfahren zur Amplif ikation einer Nucleinsfiure unter Verwendung einer DNA-Polymerase-Zusammensetzung 
nach einem der Anspruche 1 bis 4. 

7. Verwendung einer DNA-Polymerase-Zusammensetzung nach einem der Anspruche 1 bis 4 zur Amplifikation einer 
15 Nucleinsaure. 

8. Kit umfassend eine DNA-Polymerase-Zusammensetzung nach einem der Anspruche 1 bis 4. 

9. Kit nach Anspruch 8, weiterhin umfassend einen Reaktionspuffer fur die Amplifikation von langen Nucleinsaure- 
20 sequenzen mithiife der Poiymerasekettenreaktion, wobei der Reaktionspuffer vorzugsweise etwa 25 mM Tricin, 

80 mM KOAc, 1 0 Gew.-% Glycerin und 2,25 Vol.-% DMSO enthalt. 



Revendlcatlons 

25 

1. Composition d'ADN polymerase pour la I' amplification, par la reaction d'amplification en chaTne par polymerase, 
de sequences d'acide nucleique d'une longueur superieure a 1 0 kb, consistant en une association d'une premiere 
ADN polymerase et d'une seconde ADN polymerase, dans laquelle ladite premiere ADN polymerase est I'ADN 
polymerase de Thermus thermophilus et dans laquelle ladite seconde ADN polymerase est choisie dans le group e 
30 d'ADN polymerases consistant en I'ADN polymerase de Thermococcus litoralis, I'ADN polymerase de Pyrococcus 

species GB-D et I'ADN polymerase de Thermotoga maritima, et dans laquelle ladite composition d'ADN polyme- 
rase consiste en environ 0,8-2,5 unites de la premiere ADN polymerase pourchaque 0,015-0,15 unite de seconde 
ADN polymerase. e 

35 2. Composition d'ADN polymerase de la revendication 1 , dans laquelle les sequences d'acide nucleique a amplifier 
sont dans la plage de 23 a 42 kb. 

3. Composition d'ADN polymerase de la revendication 1 et la revendication 2, dans laquelle ladite seconde ADN 
polymerase est I'ADN polymerase de Thermococcus litoralis. 

40 

4. Composition d'ADN polymerase telle que revendiquee dans I'une quelconque des revendications 1 a 3, dans 
laquelle ladite.composition d'ADN polymerase consiste en environ 2 unites de la premiere ADN polymerase pour 
chaque 0,08 unite de seconde ADN polymerase. 

45 5. Tampon de reaction pour ('amplification, par la reaction d'amplification en chaTne par polymerase, de sequences 
d'acide nucleique d'une longueur superieure a 10 kb, com pre n ant une composition d'ADN polymerase telle que 
revendiquee dans I'une quelconque des revendications 1 a 4, et comprenant en outre environ 25 mM de tricine, 
80 mM de KOAc, 1 0 % (p/v) de glycerol et 2,25 % (p/v) de DMSO. ? 

so 6. Precede pour ('amplification d'un acide nucleique a I'ide d'une composition d'ADN polymerase telle que revendi- 
quee dans i'une quelconque des revendications 1 a 4. 

7. Utilisation d'une composition d'ADN polymerase telle que revendiquee dans I'une quelconque des revendications 
1 a 4, pour ('amplification d'un acide nucleique. 
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Necessaire comprenant une composition d'ADN polymerase telle que revendiquee dans i'une quelconque des 
revendications 1 a 4. 
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Necessaire de la revendication 8, comprenant un tampon de reaction pour ('amplification, par la reaction d'ampli- 
fication en chaTne par potymeras , de longues sequences d'acide nucleique, dans lequel I drt tampon de reaction 
comprend environ 25 mM de tricine, 80 mM de KOAc, 10 % (p/v) de glycerol et 2,25 % (v/v) d DMSO. 
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